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Description 

Technical Field 

[0001 ] This invention is in the field of lymphadenopa- 
thy syndrome and/or acquired immune deficiency syn- 
drome. 

Background Art 

[0002] With the discovery of human T-cell lympho- 
tropic virus-l (HTLV-I) as an infectious agent in humans, 
it was established that retroviruses could infect humans 
and could be the etiological agent of disease. After HT- 
LV-I was established, a second retrovirus of the same 
family, HTLV-I I was found in a hairy cell leukemia estab- 
lished strain. Since that time, other human retroviruses 
have been isolated which are associated with lymphad- 
enopathy syndrome (LAS) and/or acquired immune de- 
ficiency syndrome (AIDS) victims. Various retroviruses 
have been isolated from individuals with AIDS (some- 
times called HTLV-III) or LAS (sometimes called LAV). 
See for example, Barre-Sinoussi, et al, Science (1983) 
220:868-871 and Montagnier, et al, Cold Spring Harbor 
Symposium (1 984) in press; Vilmer, et al, Lancet (1 984) 
1:753, Popovic, et al, Science (1984) 224:497 and Gal- 
lo, et al, Science (1984) 224:500. A comparison of HT- 
LV-III and LAV may be found in Feorino, et al, (1984), 
supra. See also, Klatzman, et al, Science (1984) 225 : 
59-62, Montagnier, et al, ibid (1984) 63-66, and the ref- 
erences cited therein for a survey of the field. A general 
discussion of the T-cell leukemia viruses may be found 
in Marx, Science (1984) 224:475-477. Levy, et al, Scj: 
ence (1984) 225:840-842 report the isolation of ARV 
(AIDS-associated retroviruses). 
[0003] For the purposes of this application, these vi- 
ruses (HTLV-III, LAV, and ARV) will be generically re- 
ferred to as human T-cell lymphotropic retrovirus (hT- 
LR). The hTLRs may be shown to be of the same class 
by being similar in their morphology, serology, reverse 
transcriptase optima and cytopathology, as identified in 
the above references. For example, the reverse tran- 
scriptase prefers Mg +2 , and has a pH optima of about 
7.8. 

Disclosure of the Invention 

[0004] The invention is defined in the claims. 

Brief Description of the Drawings 

[0005] 

Figure 1 is a restriction map of proviral DNA (ARV- 
2). 

Figure 2 is the nucleotide sequence of ARV-2(9B). 
The amino acid sequences for the products of the 
gag, pol, and env genes are indicated. The U3, R, 



and U5 regions of the LTRs are also designated. 
The cap site is position +1. A 3 bp inverted repeat 
at the ends of the LTR, the TATA box at position -29, 
the sequence complementary to the 3'-end of the 
tRNA'ys at position 183, and the polyadenylation 
signal at position 9174 are underlined. The over- 
lines indicate the amino sequences determined 
from virion proteins. The nucleotides at the begin- 
ning of each line are numbered, and the amino ac- 
ids at the end of each line are indicated. 

Modes for Carrying Out the Invention 

[0006] The hTLR DNA sequences are synthesized at 
least in part and may be used for expression of polypep- 
tides which may be a precursor protein subject to further 
manipulation by cleavage, or a complete mature protein 
or fragment thereof. The smallest sequence of interest, 
so as to provide a sequence encoding an amino acid 
sequence capable of specific binding to a receptor, e.g., 
an immunoglobulin, will be 21 bp, usually at least 45 bp, 
exclusive of the initiation codon. The sequence may 
code for any greater portion of or the complete polypep- 
tide, or may include flanking regions of a precursor 
polypeptide, so as to include portions of sequences or 
entire sequences coding for two or more different ma- 
ture polypeptides. The sequence will usually be less 
than about 5 kbp, more usually less than about 3 kbp. 
[0007] Sequences of particular interest having open 
reading frames (Figure 5) define the structural genes for 
the gag proteins (p16 and p25) and the env protein . It 
is to be understood that the above sequences may be 
spliced to other sequences present in the retrovirus, so 
that the 5*-end of the sequence may not code for the N- 
terminal amino acid of the expression product. The 
splice site may be at the 5'-terminus of the open reading 
frame or internal to the open reading frame. The initia- 
tion codon for the protein may not be the first codon for 
methionine, but may be the second or third methionine, 
so that employing the entire sequence indicated above 
may result in an extended protein. However, for the gag 
and env genes there will be proteolytic processing in 
mammalian cells, which processing may include the re- 
moval of extra amino acids. 

[0008] In isolating the different domains the provirus 
may be digested with restriction endonucleases, the 
fragments electrophoresed and fragments having the 
proper size and duplexing with a probe, when available, 
are isolated, cloned in a cloning vector, and excised from 
the vector. The fragments may then be manipulated for 
expression. Superfluous nucleotides may be removed 
from one or both termini using Ba131 digestion. By re- 
striction mapping convenient restriction sites may be lo- 
cated external or internal to the coding region. Primer 
repair or in vitro mutagenesis may be employed for de- 
fining a terminus, for insertions, deletion, point or multi- 
ple mutations, or the like, where codons may be 
changed, either cryptic or changing the amino acid, re- 
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striction sites introduced or removed, or the like, where 
the gene has been truncated, the lost nucleotides may 
be replaced using an adaptor. Adaptors are particularly 
useful for joining coding regions to ensure the proper 
reading frame. 

[0009] The env domain of the hTLR genome can be 
obtained by digestion of the provirus with Eco RI and 
Kpnl and purification of a 3300 base pair (bp) fragment, 
which fragment contains about 400 bp of 5' non-coding 
and about 200 bp of 3' non-coding region. Three differ- 
ent methionines coded for by the sequence in the 5' end 
of the open reading frame may serve as translational 
initiation sites. 

[0010] Digestion of proviral sequences with Sad and 
EcoRV provides a fragment of about 2300 bp wh ich con- 
tains the gag domain and a second small open reading 
frame towards the 3' end of the gag region. The gag do- 
main is about 1500 bp and codes for a large precursor 
protein which is processed to yield proteins of about 
25.000 (p25), 16.000 (p16) and 12,000 (p12) daltons. 
Digestion with Sacl and Bglll may also be used to obtain 
exclusively the gag domain with p12, p25 and partial p1 6 
regions. 

[0011] The hTLR DNA sequences may be labeled 
with isotopic or non-isotopic labels or markers and be 
used as DNA probes to detect the presence of native 
hTLR nucleotide sequences in samples suspected of 
containing same. 

[0012] The following examples are offered by way of 
illustration and not by way of limitation. 

EXPERIMENTAL 

1 . AIDS related virus-2 (ARV-2) purification and 
preparation of viral RNA . 

[0013] HUT-78 cells infected with ARV-2 (ATCC Ac- 
cession No. CRL B597, deposited on August 7, 1984) 
were obtained from Dr. Jay Levy, University of Califor- 
nia, San Francisco. Cultures were grown for two weeks 
in RPM1 medium with 10% fetal calf serum. Cultures 
were centrifuged at 2 Krpm for 1 hr at 4°C using a SW- 
2B rotor. The pellet, containing the virus, was resus- 
pended in 10 mM Tris-HCI, pH 7.5 on ice. The resus- 
pended pellet was treated with 10 u.g of DNase (Boe- 
hringer Mannhein) and was layered onto a linear su- 
crose gradient (15-50% in 10 mM Tris-HCI, pH 7.5, 1 
mM EDTA, 20 mM NaCI). The gradient was spun at 34 
Krpm for 4 hr at 4°C, in SW-41 rotor. Five 2.5 ml fractions 
were collected and an aliquot of each was electro- 
phoresed in a 1% agarose, 5 mM methyl mercury hy- 
droxide gel (Bailey and Davidson. Anal Biochem (1 976) 
70:75-85) to determine which contained the 9 kb viral 
RNA. The fraction containing the viral RNA was diluted: 
to 10 ml in 10 mM Tris-HCI, pH 7.5, 1 mM EDTA and 
was centrifuged at 34 Krpm for 2 hr at 4°C. The pellet 
was resuspended in 20 mM Tris-HCI, pH 7.6, 10 mM 
EDTA, 0.1% SDS, and 200 ug/ml proteinase K. Incuba- 



tion was carried out for 1 5 min at room temperature. The 
mixture was extracted with phenol and the aqueous 
phase was made 400 mM NaCI and precipitated with 
ethanol. The pellet was resuspended in water and 

5 stored at -70°C. 

[0014] To purify the viral RNA from the nucleic acid 
pellet obtained as described above, a sample was elec- 
trophoresed in a low-melting 1% agarose gel containing 
5 mM Methyl mercury hydroxide. After electrophoresis, 

io the gel was stained with 0.1 % ethidium bromide and nu- 
cleic acid bands were visualized under UV light. The re- 
gion corresponding to 9 kb was cut from the gel and the 
agarose was melted at 70°C for 2 to 3 min in three vol- 
umes of 0.3 M NaCI, 10 mM Tris, pH 7.5, 1 mM EDTA. 

is The mixture was extracted with an equal volume of phe- 
nol. The aqueous phase was reextracted with phenol 
and was precipitated with ethanol. The pellet was 
washed with cold 95% ethanol, air dried, resuspended 
in water and stored at -70°C until use. One hundred ml 

20 of culture medium yielded 0.5 to 1 jig of purified RNA. 

2. Synthesis of labeled homologous viral probe . 

[0015] A 32 P-labeled cDNA was made to the gel puri- 

25 fled viral RNA using random primers (calf thymus prim- 
ers) prepared as described in Maniatis, et al, A Labora- 
tory Manual , Cold Spring Harbor, NY, 1 982. The reaction 
mixture contained 2 uJ of 0.5 M MgCI 2 ; 5 uJ of 0.1 M 
dithiothreitol; 2.5 jxl each of 10 mM dATP, 10 mM dGTP 

30 and 10 mM dTTP; 2.5 pJ calf thymus primer (100A2 60 / 
ml); 0.5 u.g viral RNA: 5 u.l of actinomycin D (200 u.g/ml); 
10 uJ of 32 P-dCTP (> 3000 Ci/mmole, 1 mCi/ml) and 1 
ixl of AMV reverse transcriptase (17 units/uJ) in a 50 uJ 
reaction volume. The reaction was incubated for 1 hr at 

35 37°C. The probe was purified away from free nucle- 
otides by gel filtration using a Sephadex G50 column. 
The void volume was pooled, NaCI was added to a final 
concentration of 400 mM and carrier single-stranded 
DNA to 100 fig/ml, and the cDNA was precipitated with 

40 ethanol. The pellet was resuspended in water and 
incorporated 32 P counts were determined. 

3. Detection of ARV sequences in polyA+ RNA prepared 
from infected HUT-78 cells . 

45 

[0016] PolyA+ RNA was prepared from HUT-78 cells 
infected with ARV-2, ARV-3 or ARV-4 (three different 
isolates from three different AIDS patients) and from un- 
infected HUT-78 cells. The polyA, RNA was electro- 

50 phoresed on 1% agarose gels containing 5 mM methyl 
mercury hydroxide (Bailey and Davidson, supra), was 
transferred to nitrocellulose filters, and hybridized with 
the homologous probe prepared as described in Section 
2. Hybridizations were carried out in 50% formamide, 3 

55 x SSC at 42°C. Washes were at 50°C in 0.2 x SSC. A 
9 kbp band was present in all three samples of infected 
HUT-7B cells. This band was absent in polyA+ from un- 
infected cells. 
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4. Detection of ARV sequences in infected and non- 
infected HUT-78 cells . 

[0017] High molecular weight DNA (chromosomal) 
was prepared from cultures of HUT-78 cells infected 5 
with ARV-2 and from non-infected HUT-78 cells follow- 
ing the procedure of Luciw, et al, Molec and Cell Biol 
(1984) 4:1260-1269. The DNA was digested with re- 
striction enzyme(s), electrophoresed in 1% agarose 
gels and blotted onto nitrocellulose following the proce- 
dure described by Southern, (1975), supra. Blots were 
hybridized with the 32 P-labeled probe (10 6 cpm/blot) in 
a mixture containing 50% formamide, 3 x SSC, 10 mM 
Hepes, pH 7.0, 100 u.g/ml denatured carrier DNA, 100 
u.g/ml yeast RNA and 1 x Denhardt's for 36 hr at 42°C. 
Filters were washed once at room temperature in 2 x 
SSC and twice at 42°C in 0.2 x SSC, 0.1 % SDS. Filters 
were air dried and exposed to X-Omat film using an in- 
tensifying screen. 

[0018] The homologous 32 P-probe to ARV-2 hybrid- 
ized specifically to two bands in the DNA from infected 
cells restricted with Sac l. These bands were absent 
when DNA of non-infected cells was used, indicating 
that the probe is hybridizing specifically to infected cells 
presumably to the provirus integrated in the chromo- 
somal DNA. The molecular weight of the bands is ap- 
proximately 5 kb and 3 kb. 

[0019] In order to determine if different enzymes 
would cut the proviral sequence, several other restric- 
tion digestions of the cell DNA were carried out using 
EcoRI, Sph l or Kpn l or double digestions using two of 
them. Southern results show specific bands hybridizing 
when DNA of infected cells is used. Figure 1 shows a 
schematic map of the positions of restriction enzyme 
sites in the proviral sequence, and indicates fragment 
sites. 

5. Cloning of proviral ARV-2 DNA . 

[0020] High molecular weight cell DNA from infected 
HUT-78 cells was prepared following the procedure of 
Luciw, et al, supra. The DNA was digested with EcoRI, 
which cuts once in the previrus, centrifuged in a sucrose 
gradient and fractions corresponding to 8-15 kb were 
pooled, dialyzed and concentrated by ethanol precipita- 
tion. The bacteriophage X derivative cloning vector, EM- 
BL-4 (Karn, et al, Methods Enzymol (1983) 101:3-19) 
was digested to completion with a mixture of Eco RI, 
Bam HI and Sail restriction enzymes and the DNA then 
deproteinized by phenol-chloroform extraction, precipi- 
tated with cold ethanol and resuspended in ligation buff- 
er. The EMBL-4 phage DNA and Eco RI digest of cellular 
DNA were mixed and ligated and the resultant recom- 
binant phage genomes packaged in vitro . After phage 
infection of X-sensitive E. coli (DP50supF), about 
500,000 phage plaques were transferred onto nitrocel- 
lulose filters, DNA was fixed and the filters were 
screened with a homologous 32 P-probe prepared as de- 



scribed in Section 2. Eleven recombinant phage out of 
500,000 phage annealed in the initial double-lift screen- 
ing method (Maniatis, et al, Molecular Cloning, A Labo- 
ratory Manual , NY, 1 982) to viral cDNA probe, and these 
were further plaque-purified and propagated in large liq- 
uid cultures for preparation of recombinant DNA. 
Plaque-purified phage containing ARV DNA were prop- 
agated in liquid culture in E. coli DPSOsupF; phage par- 
ticles were harvested and banded in CsCI gradients and 
recombinant phage DNA was prepared by phenol ex- 
traction followed by ethanol precipitation (Maniatis, et 
al, supra). One u.g of purified phage DNA was digested 
with restriction enzymes, electrophoresed on 1% agar- 
ose gels, and visualized with ethidium bromide under 
ultraviolet light. The DNA from these gels was trans- 
ferred to nitrocellulose and annealed with viral cDNA 
probe. 

[0021] One of the 11 phage, designated X ARV-2 (9B). 
was deposited at the ATCC on 25 January 1985 and 
given Accession No. 40158. X ARV-2(9B) contained an 
insertion of full-length proviral DNA along with flanking 
cell sequences. Digestion of X ARV-2(9B) DNA with 
cl yielded viral DNA fragments of 3.8 kb and 5.7 kb. 
Eco RI digestion of X ARV-2(9B) produced virus contain- 
ing DNA species at 6.4 kb and 8.0 kb; a double digest 
of Sac l and Eco RI gave viral DNA fragments at 3.8 kb 
and 5.4 kb. This pattern is consistent with that of a pro- 
virus linked to cell DNA. 

[0022] In addition to X ARV-2(9B). phage was ob- 
tained that (1 ) possessed the left half of the viral genome 
from the Eco RI site in viral DNA extending into flanking 
cell DNA (*ARV-2(8A)) and (2) phage that had the right 
half of the viral genome (XARV-2(7D)) from the Eco RI 
site in viral DNA extending into flanking cell DNA. Bac- 
teriophages XARV-2(7D) (right) and XARV-2(8A).(left) 
were deposited at the ATCC on October 26, 1984 and 
given Accession Nos. 40143 and 40144, respectively. 

6. Polymorphism . 

[0023] To measure the relatedness of independent 
ARV isolates, restriction enzyme digests of DNA from 
HUT-78 cells infected with ARV-3 and ARV-4 were an- 
alyzed with the probe made from cloned ARV-2 DNA. 
The Sacl digest of ARV-3 DNA was similar to that of 
ARV-2 whereas the Hindi II digests displayed different 
patterns. The Sac l digest and the Pstl digest of ARV-4 
DNA differed from the corresponding digests of ARV-2 
DNA. The intensity of the annealing signals obtained 
with ARV-3 and ARV-4 samples was much lower (about 
10-fold less) than that for ARV-2 DNA probably as a re- 
sult of the fact that fewer cells were infected in the ARV- 
3 and ARV-4 cultures. The viral-specific DNA fragments 
produced by Sac l treatment of ARV-3 and ARV-4 DNA 
totaled 9.0-9.5 kbp, a value similar to that of ARV-2 and 
in consonance with the RNA genome sizes. 
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7. Sequencing of proviral DNA . 

[0024] Fragments or subfragments of ARV-2 DNA 
from X phage 9B were prepared and cloned into M13 
according to conventional procedures (Maniatis, et al, 
supra). Sequencing was performed according to 
Sanger, et al, Proc Natl Acad Sci USA (1 977) 74:5463, 
using the universal M13 primer or chemically synthe- 
sized primers complementary to ARV-2 sequence. The 
sequence is shown in Figure 2. 



Claims 

1. A process for the preparation of an isolated DNA 
polynucleotide comprising a fragment of at least 21 
bp from the gag or env region of the ARV-2 se- 
quence of Figure 2, wherein the DNA polynucle- 
otide is chemically synthesised at least in part. 

2. The process of claim 1 , wherein the DNA polynu- 
cleotide comprises a fragment of at least 21 bp from 
the gag region of the ARV-2 sequence of Figure 2. 

3. The process of claim 1 , wherein the DNA polynu- 
cleotide comprises a fragment of at least 21 bp from 
the env region of the ARV-2 sequence of Figure 2. 

4. The process of any preceding claim, wherein the 
DNA polynucleotide is labelled. 

5. The process of claim 4, wherein the DNA polynu- 
cleotide is labelled with an isotopic label. 

6. The process of claim 4, wherein the DNA polynu- 
cleotide is labelled with a non-isotopic label. 



Patentanspruche 

1 . Verfahren zur Herstellung eines isolierten DNA-Po- 
lynucleotids, das ein Fragment von mindestens 21 
bp aus der gag-Region Oder der env-Region der 
ARV-2-Sequenz gemaB Figur 2 umfaBt, wobei das 
DNA-Polynucleotid mindestens teilweise chemisch 
synthetisiert wird. 

2. VerfahrennachAnspruch 1 , wobei das DNA-Polynu- 
cleotid ein Fragment von mindestens 21 bp aus der 
gag-Region der ARV-2-Sequenz gemaB Figur 2 
umfaBt. 

3. VerfahrennachAnspruch 1 , wobei das DNA-Polynu- 
cleotid ein Fragment von mindestens 21 bp aus der 
env-Region der ARV-2-Sequenz gemaB Figur 2 
umfaBt. 

4. Verfahren nach einem der vorhergehenden Anspru- 



che, wobei das DNA-Polynuc!eotid markiert ist. 

5. Verfahren nach Anspruch 4, wobei das DNA-Polynu- 
cleotid mit einem isotopischen Marker markiert ist. 

5 

6. Verfahren nach Anspruch 4, wobei das DNA-Polynu- 
cleotid mit einem nicht-isotopischen Marker mar- 
kiert ist. 

10 

Revendications 

1. Proced6 pour la preparation d'un polynucleotide 
d'ADN isol6 comprenant un fragment d'au moins 21 

15 pb de la region gag ou env de la sequence ARV-2 
de la figure 2, dans lequel le polynucleotide d'ADN 
est chimiquement synth6tis6 au moins en partie. 

2. Proc6d6 selon la revendication 1 , dans lequel le po- 
20 lynucteotide d'ADN comprend un fragment d'au 

moins 21 pb de la region gag de la sequence ARV- 
2 de la figure 2. 

3. Proc6d§ selon la revendication 1 , dans lequel le po- 
25 lynucl6otide d'ADN comprend un fragment d'au 

moins 21 pb de la region env de la sequence ARV- 
2 de la figure 2. 

4. Proc6de selon Tune quelconque des revendications 
30 pr6c6dentes, dans lequel le polynucleotide d'ADN 

est marque\ 

5. Proc6d6 selon la revendication 4, dans lequel le po- 
lynucleotide d'ADN est marqu£ avec un marqueur 

35 isotopique. 

6. Proc6d6 selon la revendication 4, dans lequel le po- 
lynucleotide d'ADN est marqu6 avec un marqueur 
non isotopique. 
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CI »««aL««CUCl7it^tT*lat (|«StrPfeAr«riM>L««At*4Ufrpr«Uy«?4lV«l«lo«tiitj«*t»fb«S«r»roCt*l«lll«Pr»tl«tr*tS«rAt*l«» 16 1 

MB CA4AACCTACASC&KAAATCKY&CATCA(MCAtAVCA<CTA4i^f Tf AAATUAf M6TAAAABTAC1ACAAttAAAA££CT1 ICA4CCCACAA6TAATACCCAT8T1TTCA£CA1TA 

8 66 1CASAA66A4XCACCCCACAA6AT TIAAAXACCAItCTAAAtACAO TC ttSACAACA TCAA«A6CCATACAAATftTT»)><l>6*C>CT*TCAAT<l6<tt6CTtCAtAATCCfrATAftAtrC 

»t«frefa1ftMAI«6fpr>«1>4AlAtM6>yUaltatArB6l«»ro^^ l€l 
•IS CATCCA4T6XAT«A4*6<CTAtT«iCCIS6CtAAatO^ 

H«rr'otaU>76lolliTf rtr»^Trpt 1 »l**k»«ei jltaAtaLjt ArglUt Tir>trrraThrl«r II«L««Atpn a Ar«61*Clr^r«LytClorroFlMirflAip 101 

HQS AlCCCA67AC6A8A4A7CTAIA4AA4ATi4UlJJ»T4gTM6AT?A^ 

lilt TAT8TA6AXC64mTATAAJWC1M6*UC66ACAA6XHCU 

iiAB KltmilccuclicTlelct m 

!1*ll««N7tTTAA7^l7An7btAT^ All 
1466 6TA4TaV»aX46W60<^Wtt68^ ^ 

Cj«4trAreClQ6tj6lt6laXaUfsAapCj»Tft*Cf«Affl8taAl4Aaa^ ASt 

fb«ra«Arc)S1gAspl««Al«»B«La«€' " * ' 

U66 TtTCtlACBflAACfcAtACCAAAIftAAALATTtCACTCKUAfACCf TAATTTTTTACC&AAAA f CT66CCTTCCT At 



»terfca Ar? SI g As plaaAl a »Nt aaC 1 a«l atM A1 t Ar p 6 1 » rn S«r Sar ftlotl a TkrAfyA la > 1 
jl^tgtaAAtCCCISCftAATTTTCnCABACCAftACtAtABCCA 



T»rAltrr*?rail«tUIarPtoA*«*'b46ty*l»C1«LriTB?™^ »01 
AAaltrrrarfcr4sa4r*6lalaa61aVair*pS7/6>/€»oAi»As*S 6) 
1708 A6AtC€C6ACCAtAA6ACAB4^TCAAOTf T4^M6Jtt6 4 6AAA< C AA C T^ 

Ai a» ra ta Mar 61 aOC 

Ara'rotaaVamrnaAra|la61i61*6lala«t*lCtoAlakaaLa«Aa |01 
mi cIttCCTCQTUCAATAA6tATAtt U U C <A C TA^W 

fr)/«1/IUC) 7 0^^fc«lt#t7tf*Ur 0 C»-T/f'A»pCI»JUPro».UU!WC 7 »C»7ai»W7**»*I1tC»/T*rTOt«»»*»C>rrrorA^»/'OF*)A..jUrUO),Arfl 141 
1646 6646AA7 T66A66TTTTATCAAA6TAA6406TAC6^TCAfiAf ACCTCf AAA 

Aa*Uala»Tar6U|li61jCr*TWla«Aaark«*rallaSar6f^m 161 
1046 ATCTSnwCKACJlTT6«TT«TACTTTAJL*T?TCCCtATTJtfTCCTArr4AA«^ 

61si r tll*VytAUlt«fa1Clat>«CrtTBrCI«iMtSloLyA6U61yL>an«S«rlftl1a6l7»ro«Uata»roTfrA«t.TArf roVftl»kaAla|1aLmm7*4»9**r «» 
1166 JUbMUUTAAAA6tATTA6TA646ATAf61AtA4yUA7«lU»f6M6gtU^ 

tkrlyslrpAr«liUa»?4lA*prkaA>tf6lgla«4satj«A™T»^ 661 
1 166 C TAAA T66A6AAAACT A6T "»w»*"^ttinwii«i«ftri*cAefwfeecA>cmiitm«Aitif»tfi^»>wit^tii^ti«i;i.Ilii t ^ T ^ tffl fAT 

UaAipW61jAftpAlaT»r»kaSarVal»raLa«A*pt^aAi64>M6r^ 101 
1*16 T«4AT«T666T6Af6«AfA«rmCA<mCCTtAOATAAAaA<TnA6AAA8fAmT6AAn 

Laa*r«8Uftl»1r6lJtS1rlar6f^Ullar*a6taSartarta^ 14} 
1*46 T6^CJCA646AFA4AAA4<ArCAtCAAXAArATTCCAAAtTAA<ATftA<AAAAATCy fAAAtCC rrfTACAAAACACAA rCCAgA£ATAAfTATCTATCAATACATmATCAf TTgTATft 

fat«lyi4rA»plaaSlalU6lf61aai»A^T»v«^*tla6ta6ta4*aA>^ 161 
1168 TA66ATtT6A<m6AAATA»«A«ATA 6 aA C Aa M atA 6M aU m A 6 A C AC U f^^ 

Tr0AJat61/Tjriletaa6larraAi6l7»Tr»T»r»alSl»>r^ 4*1 
1788 88AT868TT6T8AACTCC6IC8T8A f AAA764ACAS^AtA6£tTATAATkXT6CC 1 6 1 1 ■ 668 AC >Cf T6<4<T6TCAAT0ArATACA4WUA877A6T6A^AAAAT76AATTtt6tAA6TC 

81a|laTaa4l*81ilWly»lalvj»«1alMCF»U*UatAaAM61»n^ 461 
1608 A^TTI6T6XA^twITA6A6TAAASTA6TTATC^ 



|lata«t*a6W6ralAlN1»6la?atT»rtfYto»r>ef*r^ Mi 
1028 TOT^AAAASAAC06TAA^TIiA6TA7ATTAT6AC^^ 



1U6 



la«l*»Ter«l»l7iTirrAl«4i^«Ai^ClrA>»»tf7kra**A*»?«ll^ 641 
tSLmA&MAJAQj? OTICQA^M 

•kai»ll«»**«ll^*«Vj* B laTfc#troClaAta?aptrpa*t81a?*r^^ Mi 
1666 MA66*TaXCC0T Ajt6JAM6i6^ 

Uaa1al»t61aa*alla»al61flla6»»?kr»4»T»»?alA»p*lfAlaAlU^ til 
1168 f M fi««-»««»r»*»"*»«f»*^«»«««»»™»-»«»^» *«^«^ 

|lailaAa#Tkrrk»A»aCUVl4T«*«1ata»6laUHatla61al^Al4*^ an 
1U8 tA6CT6ACMAAUAATCAamU64Anj*W 

61a»«Aapij**ar61o8«>nat0a»al8**Ol*llglla6T*^^ TQi 
1018 AAXCA86TA48A<y6AU>tCA6A6TtA^TCA87CAAATAATA8A6 

A*al«l40»il««P*»«CI»naA*§l»*»all«»6^L^ ;*j 
1746 AT4MTT06jfCA8I6CTOJ6^TC66 

araTrafAlfaUl.tjaCl-naiAlAlal^CrsAta^raC^I^^ 7»i 
1666 CA6Cf6TA6TA6CA4A66>AAIA6T44XCA4XT6T«^ 

8la6inf*IUUaiaa*amifalMafAlA»at*rtlar^ 611 
1068 AAA6AAAJLATTA tCC ItfiT A (CACTTCA T « T A6CCA AfUAt A TATI BM6T a 64 A6 T IAITC6i Cf I IT I f AC U44f A66AAAC 66C4 TAf TTTCTCTI AAAA ft A6CA66AA6A TC&C 



FIG. 2-1 



G 
A 
G 



P 

O 
L 



7 



EP 0 518 443 B1 



> !?l!!if ,U ' ,, »» ,, M#AApA»M»fl»#AiaMAfkrM#Tk»y^ • 
tlM CAITAAA*At*iUtAlKilU41UUltCAAItKiCCA«!«riCMtUAtlCteCCH1t6*TCt*Cl£ttAKAAUA4lJU!TTt«lTTCCClAXJLATtCCCAi^BTCAAt 

.... , '!!;!!! w, »*« r *»*»*»ftMiaWalfftltft^ toi 

•lit ««tl*U«CAAr<TATtt»MAAIftAAI7AAAIAAAAIlA7A<AAUS*TAACAWT 

.... tlfCI » ,, «* l » 6, /V»»»»«l«*1i*l»*r»H-»*IM»IUII»»l#f»#*»»H»*1»r»»l»lH»l»»CUtj»BI«IUT»rlf 1 1 1 *C I ftl to^t AM *■ M ,r T, r Ar, f«J 
iUt ••«C»AttCUtAA1ACAmCA6UAAAA»AATA«tAMCAlMTA«eAAM*^ 

Mu AC *MAAAftftyCCCC7iy>ftAftAA>MMCCAAA«yrCTCrftftJUU^^ 

••«• UMmyilAAAAtAMTC<*A«ftTM7AAT7ftTfc^iACAA67AMtAMAT6AO^ 

«f0« TFf F4f AClCArCACTATCAAAClACtCATCCAAUCTAMTTCMAAf TACACATCCCCCf A<fi(4AtSCT&AATTWf AA f AA£AA£AyAyTC4£4TCT&CATA<AMA4AAA0AAAA 

4111 lftMftyTy>ftMCAftftftA4TCftCCAU«MTftt*a*AAAM0JUAyAyM 

• •Aft AC tAT AAAAAAlCg tAl ATTABCA1ATACACT TAtf CCI ACAtCTfiA^TATtX4gf *CCA^ttJgAAtAtAt^TttCf AtAATAf "ITftCJlf TACf ley .TTtAt**^»^f » 

MM AtAAAICtACCITYCCC t A 8 TCY YAA tAAA C 70ACAU WA ¥ AAAI0 AAACAAACCCCAfiAAOACC AA£AACCACAAAM4ACCCAT AC AAT 6AAT lAACAX TAfiAGCf T1 T AAAACAOC 

ins vYAAucACA4Ccrc7TACACATr77tc7A<KCA7CttTecA7AM^ 

*»• AACTCt 1 CT 1 1 A TTCA 1 1 TC A4AATTUAT 8TCAACATASCA4AAT ACfiCAflATf CAAUSAfiSASA tCAAtAAAAAAt t£A Ct CAST A4ATCC T AATt t A4A&CCCTC&AA&C A tCC A 

Ml ft MAAftTCAftCClftWCUCTUlAACAATTtXTAntTAAAAJ^ 

>Mft C4AC 4A A£A£C 1 CC 1C A 44 AC AA7C AAAC7C A 7C AA4CT t C TC T A7C AAAACA47 AACT A47 A AAI BT AA T4CAATC T TT ACAAA YA 7T A4CAATAC7A TCATT A4TA47A4 Y ASC AAT A 

tin/U**CUUsTftrf«MltJUtLftV«l II 
Ifttft AiittAAUtttMttteA^itAciiftr itiAAtmta^iiAtitTttMfitiBiiiittiif iB*fT t tmitiei>t»i^Mi*i^ T t tM(fl ^ tTttr ^ Ttft ^ tTt 

Lyt4tyYkrArgAr|A a »t/r61aat»iA«?rpAroTrpC1j7art«*l«^ ji 
MM AAC&tCACCACtACtAATTATCAtCACTTATfttAfeATMCMAMTIftCICCrTMMTCTTIUlTMTCIftTA^f MTirifiAAAAITtftT6MTCACA<ytTATTATMJtftTI<CTftT0 

Trpt/t»liUl«riirTAr1krL««»k«Cy«Al«$trA»pAliAr»Al»TrrAspTkrCUt«1ftUAMr»1TrpAUThrBt»AI»CfiV*trri»TkrA«9rroAtA»rO«lafi)«V«l *1 
MOft Y4ftA^4AAtXAACYA*CftCtCTAYTY7ftT«ATMMY6tTAMBMTAraftttAW 

VAH«»AlyAt.«»llkr4l»Afa7k*A»»R*Mrpl^AtaAMa*tfk)«1a^^ 1)1 

MM ft 7AT 7C44 AAA rCTCACAAAAAA ri T 7AA4A7(?C4AAAAA7AACA7447ACAACA4ATA£A40ABAA7A7AA TCAOT7?A7ftA6A7CAAAA£t TAAACCCA7CT6TAAAAT7AACCCCA 

L*«<jt*«1 T»ri«*l t «C 7 tTArAlc>l«oClrL / *Al«T&r*t 0 Tt>rAt*S«rt«rAt>rrKr*C1.tl.I l«L/»C1rCt*IWt >7 VA*»<rit«r^Ai.n*TbfTbrl«^ll« 111 

• MS CYCY414YTftC77¥AAAYYtCMYM1¥Y4WM4MTACTMYMCM¥M7MTAAT70<ftftftCftlMftft7ftftftA« 

Af f A i f Lf * 1 1 «41 aljiClwAiail «l««Pk*ArfAs-atA«AtAV« > VaITy* I l«AspAi*A ) AlA^YkrYkr 7*^Aia7/r YkrAu T/fArgtA-vl 1*M1 tCrt AtaAf 4 If f V* 1 til 
«Mft AAACA1AA4AYYCACAAA4AAAAT4CACY17TTCGTAACCY1BAT47A07ACXAATA4ATAATCC tACtACTACtACCAACf Af ACCAAC1ATAC4TTCATACATTStAACAMTCAfiTC 

«)M AITAClCnttCYCTCCUACCTAYCAYYYttC^CMTVCCUYMATTAYYfiYACUUMYM^ 

Ai»V*ll*f Thr1»Ut ■C,»t»#-«l iCt y 1 1 *ArgFrol 1 1 VaI MrT&rSUiA«L««l*M»*«1y l«f I«»A1 «ftl ««1 *61»f «1 f « U 1 tAr^ttr Ai»Al*»tTA#A»6Aaa It I 
MM AAT4TCACCACA6IACAATC7A4ACAUtUITACMCAA7AeTCVUAXtCAACY 

AlAlytlftf |l«n«t»Ul*lt«Ai4^US«fniAUIUA»*Cy»TkrA^«rr<^»»AtttA«»T»rAr9ir«Mrll*T 7 rll*€ljr»roCl/A«^AI»>A a i1tTftrrAfCliArQ 111 
MM CC YAAAACCAYAA YAS7ACACCVCA4Y4AA7C MfAfiCAAf fAACfCTACAAMCCCAACAAMA IMAAMAAAMTAtCMTATAM^CAMMft^MMtTfMf ACAAMC^SAACA 

M.M,Clr*»»n.»r,l,tA|.liUC / .A%«l1«J«rAr 9 Al«Cl.Tr C ^t.A».1»a*^»^1-lUf*U7%^»t«»^*6t*«l»#»«Cl 7 A..At.l y .T»rtU»»^h« }|| 
« 7 4 ft A 7 AA 7 AS AA AA 7 A 7 A A ft A AAA &C A CA 7 T67AACA J T AO 7 AAA«4 AC AAt 4AAA7AMAC f I 7AOAACA0AYMY f AAAAAATTAACAftAACAQT T TMOAATAA lAAAACAA TM 7C TTT 



•ftM 



A«»€l»S*rSAf6lyClyA»P^oOI«IUTAlMtai»S«rM*A«aCytArgClyOI«rA»rA«TfrCf«A«*rAf toClal«*aMAkaftiaYk*frpAr«U«A»aa<«Ykrft1« All 
AATCAAttcTCAC4AC44MCCCA0A*A7Y4¥M7MMA4¥tTTAAYTtT A4AWAA A77 

4l>7*n?i4l/Ai«AipYk*|l»im»¥rroCr«AMlU 44>1 

ftftM fc^Af ^ICj^AIfcAgAfAAfgAlAgtegeAlfclACAAlAaA^gAAATtATA^ 

l»rAt*lWThrClst««a««l«W*rAr«A»p4l;C)j7brAt**»l7irAi»A»p7k^^ «») 
710ft YCAJUYAYYMAMt*¥tXTA7YAMAAlA^YMUStACAAAY4YAMTMTM 

Tyrt^%fAllWL7»IUCI»r>roMrf1»tWAI«»fvTftrl*«AlAtj»^ %J% 
fltft 7ATAAA67MlAAAMT70^CCATTaft6aAYA«MMMCAAC^AAA<AM ' . . 

AWAl«4lr*«rTfcMkf«4t7AtA««tWriMTarimfarV*1fttaAlaArM^ %t\ 
tU9 AtACtACAAAttAt TA 7K£ttUtlt1Kl£A71tA£tCT£AtC&l kfkMtt%£lf AAtlA1T&TCtEAtA1AA1 ttAtttMtt&AMthA t1Tt£*mA**ArT»r yQAtff{q{* fl {pp l { ^r 

tiatkaCIUa»Tfc*»Airrp«t7llal7iC1ata»ClaAlAArg»Atit^ All 

MAS C1C^7MAAgTtACA4TtYACg44A7CAAC*A4CYCCAAC*JU^A47Ctt6^ 

YkrV«rAI«fA>*roYrMtaAt«S«rfrpS«rAff«lrAMrt«^IMapllaYrpAipAi^^ »»l 

ffttft AefACfCtTeffcg^lTtAAAtAgfACtmAatAATAAAteigl gftilAAf I TlTC£AATAArAt AA^fTfcAAT>I£ACT £CKAa*r>n**> tTCAi-t« Tt . f . f . . ..^ 

W v la»ft)*MaMrMM»a«tA«la«t«tjrtAt^lo6t«ftY»W^^ AM 
1)03 fTACrTWAUAHO^MACUMAAftJUUACAATMACAAMAfTAUAMAT^ 

f »«|Ua»UUI«»«ly«lyli»»*ltlyt«»A« > tlU»«t^*«Al»f*U*vJ*rllAt*IAi»Artt •) Ar^Cl«61yT,rt*rrr»i«»Ur^A«CUnirAr«L*«^«t*l»ro 7J| 
»fttft tTtAfMTMfMTAl«6A^trMtAWTTlA>«AATMflTtT«T0f«TinTAtMT6AJ1A«A0T7M« 

Art4ly*r*A»pAMPr*Atp«lplU«lrtlMI«CtfCtr4l«ArftAtpA*^ 7>l 
7040 AC*C>«^CC4AXA6C*CC4M4*MTCC *AM^AAW 

LA«»*»Mrf7rA#M'at»^r«Aftpi*«t*ataall»AlAAWAsQrk»9«m til 
SOM C7CrTtAMYMtAXCMYTft^MA»ACyyMYtYYMYyi^ 

■ I.CI.l»al7*At*S«rA>*T*tl»rTrrL*«AuAUThrAUII«*l«T«irftr«UCljrArAi^r«T4l lt*tUfilAI^UAr f At.T/rArUWIUli.lt »M««1« All 

• IM CAAAAAt T AAAAAA T ASTftC Y ST TAAC T4t4 TC AAC ACCACAAC I A I AfcCA AT A_*C f CACAtA-AXAfcAf A££4T 7ATA6AAA1 A&CA£AAA(a£C7TA 7A£A(£ f AI 7C TCAACA 7 AAA 7 

• MS AMAMftnAMCAolw 

-MM .¥CWMMA4*AMjAjM4TMft*«M™W 

|S4| MAAMfttf MCt r t rCCAtrCAMCC TCA«TACC Tf f AMACCAA 7Mt7 YaCAA AAC AAA 7 7 7 A4AI A t f AMtACf f7f 7ft Aft AMftA IMKffl C7KAA0C4CT AArr ftXT 

•M> CCMA*MAA»>CaWMTCCMft*ICm^ L 

6*9? tqmYa«Tfttt*<yt M«tM«ft^ T 

$901 MAa^yytMC*faUM1*«*HfUnaMt8aXCCaWUtt^ R 

|S|| ct£CT AUC 74$ ACC^C ¥ A444A4744C4 7CCC YCA4A r«C Y4CA 7A lAAUAUUt 7177 74CC747AC74 M7C7CrC7C4rrAftACUMrCTM<CCTCaXA4<yCycy4«C. 

Mftk rAAttA440AMCeAC1«yiAAMCtCAATAAA«yi4tCft6A4lt<ytCA MTMt614rMCX47CyA17a7614A4fcy407AACIA4A4AlUCyCJk«AtCXyrirA4yCA47 

ttftt 4140AAAAAICtr?A4CAS 
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Hue l^o tide 
Positions 
Relative to 
Fiqurc 2 



3820 



3880 



3940 



4000 



4060 



4120 



41B0 



4240 



4300 



4 360 



4420 



FIG. 8-1 

WetX leAspLysAlaGln61uGluHisGluLysTyrHisSerAsnTrp 
1 A6GXAACA6: : : t ATGATr 6A: AAG6CACAA6AA6AACAT6A6AAATATCACAGTAATTG6 
TCCXTT6TC: : i ; TACT A i CT t TTCCGTGTTCTTCTT6TACTCTT TATAGTGTCATTAACC 

32 aboil, 36 nlattl, 

ArgAlaMetAlaSerAsoPheAsnLguProProValValAlaLvsGluIleValAlaSer 
. 62 A6AGCCATGGCTA6TGATTTTAACCT6CCACCTGTAGTAGCAAAAGAAATAGTAGCCAGC 
TCTC66TACCGATCACTAAAATTG6ACG6TGGACATCATC6TTTTCTTTATCATC66TCG 
ft o m 

66 ncol, 67 nlaill, 118 nspBH pvull , 119 alu1 9 

CysAspLysCys61nLeuLysGly61uAlattetHis61y61nValAspCysSerProGly 
122 T6TGATAAATGTCAGCTAAAAGGA6AAGCCAT6CATGGACAA6TAGACTGTAGTCCAGGA 
ACACTATTTACAGTC6ATTTTCCTCTTCGGTACGTACCTGTTCATCTGACATCAGGTCCT 

135 alul, 151 nlaill, 152 nsi! ava3, 155 nlaill-, 164 acd, 1 
76 apyl bstXl ecorll scrFl, 

X leTrpGl nLeuAs pCysThrMi sLeuGluGl yLys! 1 el leLeuVal A 1 aVal Mis Val 
182 ATATG6CAACTAGATT6TACACATCTAGAA66AAAAATTATCCT6GTAGCAGTTCATGTA 
TATACC6TT6ATCTAA CATG TG r A6ATCTTCCTTTTTAATA6GACCATCGTCAAGTACAT 

198 rsal, 205 icbal, 223 apyl ecorll serFl % 23* nla11l, 

AlaSerGl yTyr J leG 2 uAlaGluVa 1 1 leProA I a61uThrGlv61nG luThrA la Tyr 
242 GCCAGTGGATAT ATA6AAGCA6AA6TTATTCCAGCAGA6ACA6GGCA66AAACA6CATAT 
CGGTCACCTATATATCTTC6TCTTCAATAA66TC6TC7CTGTCCCGTCCTTTGTCGTATA. 

263 xmnl« 

PheLeuLeuLysL eu A ] aG 1 yAroTrpProVa 1 LysThrl 1 ©Hi sThrAspAsnG 1 ySer 
302 TTTCTCTTAAAATTAGCA66AA6ATGGCCAG7AAAAACAATACATACA6ACAATGGCAGC 
AAAGAGAATTTTAAT C6TCCTT CTACCGGT CATTTTT6TTAT6TATGTCTGTTACC6TC6 

321 mboll, 326 ball clPl hael, 327 haelll, 357 bbv i nu4h1 t 

AsnPheThrSerThrThrValLysAlaAlaCysTrpTrpAla61yIleLysGln61uPhe 
362 AATTTCACCA6TACTACG6TTAAGGCC6CCTGTT6GTGGGCAGGGATCAA6CA6GAATTT 
TTAAAGTG6TCATGAT6CCAATTCCGGCG6ACAACCACCCGTCCCTA6TTCGTCCTTAAA 

366 hph, 371 seal, 372 rsal, 385 haelll, 386 fnu4hl nsbll, 4 
05 binl, 406 dpnl sau3a t 

61 yl leProTyrAsnProGl nSerGlnGlyValVal6luSerf1etAsnAsnGluLeuLys 
422 GGCATTCCCTACAATCCCCAAAGTCAAGGA6TA6TA6AATCTATGAATAATGAATTAAAG 
C CGTAAG66AT6T T AGGGGT TTCA6TTCCTCAT CAT CTTAGATACTTATT ACTTAATTTC 

423 bsml, 458 hinf 1 , 

Lysl lei 1 eGl yG 1 nVal Arq AcpGlnAlaG I uH ieLeuLysThr A laVa 161 nMetAl a 
482 AAAATT ATA66ACAGGTAAGAGATCA6GCTGAACACCTTAAGACA6CAGT ACAAAT66CA 
TTTTAATATCCTGTCCATTCTCTA6TCC6ACTTGTGGAATTCTGTCGTCATGTTTACC6T 

S03 dpnl sau3a, 518 aflll, S30 rsal, 

ValPheIleHisAsnPheLysArQLys61y61yXte61yG2yryrSerAlaGly61uArg 
542 GTATTCATCCACAATTTTAAAA6AAAAG6G6GGATTGG6GGATACAGTGCA6GGGAAAGA 
CATAAGTAG6TGTTAAAATTTTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCT 

$47 fokl % 557 aha111 , 

IleValAspIleX 1 eAl aThrAsp! 1 eG XnThrUysGl uLeu6 InLysGl nX I eThrLys 
602 AT AGT AGACATAAT AGCAACAGACATACAAACTAAA6AACTACAAAAGCAAATTACAAAA 
TATCATCTGTATTATCGTTGTCT6TATGTTT6ATTTCTT6ATGTTTTCGTTTAATGTTTT 

60S acd , 

X leGl nAsnPheArg Val TyrTyrAra AspAsnLys As pProLeuTrptysG 1 yProA la 
662 ATTCAAAATTTTC6GGTTTATTACAG6GACAACAAA6ATCCCCTTTGGAAAG6ACCAGCA 
TAA6T TTTAAAA6CCCAAATAATGTCCCTGTTGTTTC7A6GGGAAACCTTTCC7GG TCGT 

697 *no2, 698 dpnl sau3a, 713 asui ava2, 
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